#################
## README FILE ##
#################

## GENERAL INFORATION
Journal: Perspectives on Politics.
Paper title: Enhancing Transparency and Replicability in Data Collection: Lessons from the Construction of Three Education Datasets.
Authors: Adrián del Río, Wooseok Kim, Carl Henrik Knutsen, Anja Neundorf, Agustina S. Paglayan, and Eugenia Nazrullaeva.

This document describes all code and data used to replicate the results and figures in the manuscript and online appendix.



## FILES
- "replication.R" contains the R code for replicating all analyses in the manuscript.
- "data.RDS": main data file.
- "data_coder.RDS": coder-level data.
- "vdem13.RDS": V-Dem v13 dataset.
- "acd.RDS": armed conflict data.
- "ciri.RDS": Cingranelli-Richards human rights data.
- "cv.RDS": Correlates of War data.
- "pts.RDS": Political Terror Scale data.



## SOFTWARE AND INSTRUCTIONS
R version 4.5.1.

Figures are stored in the folder "outputs".
Other results used for the figures are stored in the folder "outputs/results".

The replication code generates the following files in the manuscript and appendix:
- Figure 1. Comparisons of Different Measures of the Same Concept.
- Figure 2. Education/Curriculum Centralization (EPSM, V-Indoc, and HEQ).
- Figure 3. Trends in politicized teacher recruitment (V-Indoc and HEQ).
- Figure 4. Religious Instruction in Primary Schools (EPSM and HEQ).
- Figure A1. Experts’ confidence levels to code education-related questions over time at recruitment (top panel).
- Figure B1. ATT Estimates: Democracy and Education Centralization.
- Figure B2. ATT Estimates: Democracy and Curriculum Centralization.
- Figure C1. Number of coders in V-Indoc for five countries over time.
- Figure C2. Distribution of V-Indoc coders’ confidence levels.





